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Abstract 

In this paper, we present a new exact algorithm for counting perfect matchings, which re- 
lies on neither inclusion-exclusion principle nor tree-decompositions. For any bipartite graph 
of 2n nodes and An edges such that A > 3, our algorithm runs with O*(20-l/O(Alog A))n) 
time and exponential space. Compared to the previous algorithms, it achieves a better 
time bound in the sense that the performance degradation to the increase of A is quite 
slower. The main idea of our algorithm is a new reduction to the problem of computing 
the cut-weight distribution of the input graph. The primary ingredient of this reduction is 
Mac Williams Identity derived from elementary coding theory. The whole of our algorithm 
is designed by combining that reduction with a non-trivial fast algorithm computing the 
cut-weight distribution. To the best of our knowledge, the approach posed in this paper is 
new and may be of independent interest. 

1 Introduction 

Counting perfect matchings in given input graph G is recognized as one of hard combinatorial 
problems. In particular, the case that G is bipartite has attracted much attention with its long 
history because of the relation to the computation of permanent, which is a characteristic value 
of matrices with many important applications. Since counting perfect matchings for bipartite 
graphs belongs to ^P-complete, there seems to be no algorithm which runs within polynomial 
time for any input. Thus all of the previous studies lies on one (or more) of the following 
directions: Approximation, restriction of input graphs, or exact exponential algorithms. In this 
paper, we focus on the third line. 

A seminal exponential-time algorithm for counting perfect matchings is Ryser's one based 
on the inclusion-exclusion principle [25]. For any bipartite graph G of 2n vertices, it counts 
perfect matchings with 0*(2 n ) time 1 and polynomial memory space. There has been several 
improvements following that work: Bax and Franklin have shown an algorithm running with 
(3*(2( 1-1 /°( n2/ ' 3 lnn )) n ) expected time and exponential space [3]. Servedio and Wan have given 
an algorithm with a time upper bound depending on the average degree A [26]. It achieves 
0*^2(i-i/O(exp(A)))n>j time anc j polynomial space. Another approach to this problem is the 
usage of tree decompositions [2, 24]. By combining the fact that sparse graphs have a treewidth 
less than (1 — e)n for some constant e (e.g., if A < 3, e ~ 5/6 holds [10]), we can obtain 
an algorithm running 0*(2( 1_e ) n ) time. All of these algorithms break 0*(2 n )-time barrier in 
some sense. However, during last 50 years, there has been proposed no algorithm achieving 
exponential-time speedup for any graph, which is a big open problem in this topic. 
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Our result presented in this paper can be put on the same line. The main contribution 
is to propose a new algorithm for counting perfect matchings. For any bipartite graph of 2n 
nodes and An edges, it runs with 0*(2( 1_1 /°( A log A )) n ) time and exponential space. While 
this algorithm does not settle the open problem stated above, its speed-up factor becomes 
substantially closer to the exponential compared to the previous algorithms. 

An important remark is that the approach we adopt is quite different from any previous 
solutions. It relies on neither inclusion-exclusion nor tree decomposition. Actually, the main 
idea is an extremely-simple reduction to the problem of computing the cut-weight distribution 
of the input graph. The precise construction of our algorithm can be summarized as follows: 

• For any odd input bipartite graph G of 2n nodes and m edges, we can show that the 
number of G"s perfect matchings is equal to the number of elements with weight m — n 
in its cycle space. In addition, for any bipartite graph G, it is possible to construct the 
odd bipartite graph G which has the same number of perfect matchings as G, by adding 
a constant number of nodes. 

• By utilizing the primal-dual relation between cycle space and cut space, we can reduce 
the problem of counting cycle-space elements with weight m-nto computing the weight 
distribution of the cut space. The technical tool behind this reduction is the use of 
MacWilliams identity, which is a well-known theorem derived from elementary coding- 
theory. That identity provides the linear transformation (by so-called Krawtchouk matri- 
ces) that maps the weight-distribution vector of any cut space to the corresponding cycle 
space. 

• Since the cardinality of the cut space is vertex-exponential, it is easy to construct a 
naive algorithm with 0*(2 2n ) running time. We improve its running time by utilizing the 
bipartiteness property and a novel technique analogous to separator decompositions. 

It should be noted that except for the last step, our approach is applicable to any graphs 
which may not be bipartite. Our reduction technique can be seen as an algebraic approach 
to the design of exact algorithms as considered in [7,19], where several kinds of algebraic 
transformations are used for appropriate handling of target universes. To the best of our 
knowledge, this is the first attempt using the transformation by MacWilliams Identity (or 
equivalently Kratwtchouk matrices) for that objective. 

The organization of the paper is as follows: We first presents several notions and definitions 
in Section 2, which includes an tiny tutorial of linear codes. Section 3 introduces our reduction 
to cut space. The algorithm to compute the cut-weight distribution is shown in Section 4 gives 
an algorithm computing the cut space. We mention the related work in Section 5, and finally 
conclude the paper in Section 6 with the open problems posed by our result. 

2 Preliminaries from Coding Theory 

A linear code C over F2 defined bynxm matrix M is the set of m-dimensional vectors as 
follows: 

C = {vM\v €F£}. 

The matrix M is called the generator matrix of C. By the definition, code C is the linear 
subspace of F™ spanned by the row vectors of M. The rank of that subspace is denoted by 
rank(M). Clearly, the number of codewords in C (denoted by \C\) is equal to 2 rank<yM \ A 
(m, r)-linear code is the one such that the length of codewords is m and its rank is r. 
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Let C be a linear code with generator matrix M. The parity check matrix H of C is the 
m x (m — rank(M)) matrix satisfying Hw T = for any codeword w G C. It is well-known 
that there is a duality between generator matrices and parity check matrices: For the code C ± 
with generator matrix H, it is easily verified that v T M = holds for any v G C L . That is, M 
is the parity check matrix of C-^. Then the code C 1 - is called the dual code of C . Obviously 
v T v 1 - = holds for any v G C and d -1 G C -1 . It implies that the dual code is the orthogonal 
complement of the primary code. 

Given a codeword w, the number of appearance of value 1 in to is called the weight of 
w. The weight distribution of a (m, r)-linear code C is the m-dimensional vector whose A;-th 
entry Wcffc] is the number of codewords with weight k in C. The weight distribution is often 
represented as the form of generating functions Fq(x) = Ylw=o Wc[w]x w . This function is called 
the weight-distribution polynomial of C. There is a well-known theorem providing a relationship 
between the weight-distribution polynomials of primary and dual codes: 

Theorem 1 (Mac Williams Identity [20]) Let C be a (m,r)-linear code over ¥2 and be 
its dual. Then, the following identity holds: 

F C w = i(i+*rF ci (I^). 

By comparing the coefficient of each monomial in both sides, we have the representation of 
Wc [k] by a linear sum of the weight distribution of C 1 - : 

^ TO 

Wc[i\ = ¥ Y / K ^,i)W c ±\j], (1) 
3=0 

where K m (j,i) is the value known as Krawtchouk polynomials, defined as follows: 

^>=|><)(?:'). 

3 Counting Perfect Matchings via Cycle Space 
3.1 Cut and Cycle Spaces 

In this section any arithmetic operation for elements of vectors and matrices is over field F2. Let- 
ting G = (V, E) be an undirected graph with n vertices v±, V2, ■ ■ ■ ,v n and m edges e±, e2, • • • , e m , 
its incidence matrix A G = (Afj) G F^*™ is the one such that Af- = 1 if and only if Vi is inci- 
dent to ej and Af - = otherwise. It is easy to check that the i-th row of A G is the 0-1 vector 
representation of the set of edges incident to Vi. Given a 0-1 (row) vector representation of vs 
for a vertex subset S C V, v$A G is the cutset between S and V \ S. It implies that the linear 
code defined by the generator matrix A G is equivalent to the family of edge subsets each of 
which represents a cutset, so-called the cut space of G. 

As an well-known fact, the set of all cycles in G induces a linear subspace of F™, where each 
element is a 0-1 vector representation of the edge set constituting one or more cycle (s). This 
subspace is called the cycle space of G. Note that the cycle space can be recognized as the set 
of all spanning even subgraphs (i.e., subgraphs where every vertex has an even degree). The 
matrix whose row is the basis of G"s cycle space is denoted by B G . Similarly to the cut space, 
we regard the cycle space as a linear code defined by the generator matrix B G . An important 
relationship between cut space and cycle space, stated below, is known: 
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Fact 1 The cycle space of G is the orthogonal complement of the cut space of G. 

This fact implies that the linear code associated with a cycle space is the dual code of that 
with the corresponding cut space, and vice versa. In the following argument, given an undirected 
graph G, C(G) and C _L (G) denote the linear codes defined by the generator matrices B G and 
A G respectively. We often use term "cutset of G" as the meaning of the codeword of C(G) 
associated with that cutset. The same usage is also applied for cycle spaces. 

3.2 Prom Cycle Space to Number of Perfect Matchings 

Given an undirected graph G = (V,E), we consider counting the number of perfect matchings 
of G. Since there is no perfect matching if the number of vertices is odd, we define 2n = \V\. 
Let m = \E\ for short. The degree of vertex v is denoted by d(v). First we focus on the case 
that G is an odd graph, i.e., a graph such that d(v) is odd for any v in V. The number of perfect 
matchings of odd graph G is related to G"s cycle space by the following lemma. 

Lemma 1 For any odd graph G, the number of perfect matchings in G is equal to W c ^[m — n]. 

Proof Let V = {vo,vi,--- , V2 n -i} be the set of vertices in G. We prove the lemma by 
defining a bijection between the set of codewords with weight m — n and perfect matchings. 
More precisely, we prove that the complement (in terms of the edge set of G) of any codeword 
w in C(G) with weight m — n is a 1-factor (equivalent to a perfect matching). Let G w be a 
spanning even subgraph corresponding to w. The degree of V{ G V in G w is denoted by d'(vi). 
To prove that the complement of G w is a 1-factor, it suffices to show that d'(vi) = d(vi) — 1 
holds for any Vi G V. Suppose for contradiction that d'(vi) / d(vi) — 1 holds for some Vi G V. 
Since d'(vi) < d(vi), d(yi) is odd, and d'{vi) is even (recall G w is a spanning even subgraph of 
G), we have d'(vi) < d(vi) — 1. To make ^JIZq d'(vi) = 2(m — n) hold, there must exist another 
vertex Vj satisfying d'(vj) > d(vj) — 1 => d(vj) = d'(vj). It contradicts the fact that d(vj) is 
odd. 

Combining the lemma above and Theorem 1, we obtain the following corollary: 

Corollary 1 Let G be an arbitrary odd graph. There exists an algorithm to count the number 
of perfect matchings in G with 0(mr(5m)) time provided that the weight distribution W c ±^ 
is available, where m is the number of edges in G and t(x) be the time required for arithmetic 
operations of two x-bit integers. 

Note that the absolute value of Krawtchouk polynomials has a trivial upper bound \K m (j, i)\ < 
poly(m)(j; 2 ) 2 < 2 2m +°( lo s m ), and the number of all codewords of C ± (G) is at most 2 n < 2 m . 
Thus, the time required for each arithmetic operation in the right term of formula 1 is bounded 
by r(5m). 

3.3 Transformation to Odd Bipartite Graph 

While the result in the previous subsection assumes that G is an odd graph, that assumption 
can be easily removed. The fundamental idea is to construct the odd graph G that has the same 
number of perfect matchings as G. While we only consider the case that G is a bipartite graph 
in this paper, general graph can be handled similarly. Let G = (Vi U \>2,E), be an arbitrary 
bipartite graph such that |Vi| = | V2 1 = n, and V = V± U V2 for short. The set of even-degree 
vertices in V{ is denoted by Vj Cven (i G {1, 2}). We can easily show the following lemma: 

Lemma 2 The values of \Vi vcn \ and |V 2 even | have the same parity. 
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Figure 1: The construction of G 



Proof Assume that \E\ is odd. Since X/ueV reven ^( v ) i s even f° r an y * £ {1? 2}, X^i>gy\y evcn 
must be odd. Thus, |F \ V^ cvcn | is odd for any % € {1, 2} because any node in V \ V^ cvcn has an 
odd degree. It implies that |I^ even | is odd for any {1,2}. The case of even \E\ can be proved 
similarly. 

The construction of G is given as follows: 

• Add two vertices t^i, and 5j 2 to Vi for each i £ {1,2}. 

• For each i € {1,2}, connect each node in Vf vcn with £3-1,1, and £3-^1 with v^. 

• If d(vi t i) and ^(#2,1) are even, connect them. Recall that d(v\,\) and d{v2,i) have the same 
parity from Lemma 2. 

An example of the construction is shown in Figure 1. For the constructed graph G, we have 
the following lemma. 

Lemma 3 The graph G is an odd bipartite graph, and has the same number of perfect matchings 
as G. 

Proof Any node in G clearly has an odd degree. Let M C E be any perfect matching of G. 
Since and #2,2 is degree one, edges {^1,1,^2,2} and {^2,1, £1,2} are necessarily included in 
M. Then M\ {{^1,1, "V2,2}> {^2,i 5 ^1,2}} is a perfect matching of G. Conversely, given a perfect 
matching M' C E of G, G U {{^1,1, ^2,2}; {^2, 1^1,2}} is a perfect matching of G. Thus, we have 
a one-to-one correspondence between the perfect matchings of G and those of G. The lemma 
is proved. 

4 Computing Weight Distribution 

As seen in the previous section, the computation of the cut weight distribution for graph G 
induces the number of perfect matchings of G. Thus, in what follows, we focus on algorithms 
for computing the cut weight distribution. 

The set of edges constituting a cut is associated with a partition of all vertices: A partition 
(5, V \ S) of all vertices V induces a cutset, which is the set of edges crossing between S and 
V \ S. Thus we often use the sentence "partition (S 1 , V \ S) of V" as the meaning of the cut 
associated with that partition. We define c(S,T) to be the set of edges crossing two disjoint 
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subsets S and T (S,T C V). In particular, if S (resp. T) is a singleton {v}, we use notation 
c(v,T) (resp. c(5, u)). 

While two different partitions can lead the same cutset (e.g., (S, V \ S) and (V \ S, S)), it is 
well-known that exactly 2 d subsets induce the same cutset, where d is the number of connected 
components of G and equal to n — iank(A G ). Thus, instead of computing Wq±/q\, we rather 
consider the cut-weight distribution Wq±/q\ over all partitions, that is, I^±((jJ&] = \{S C 
y||c(5, V \ S)\ = k}\. It is easy to calculate W C ±( G ^ from W c± ^ because of the relation of 
W CHG) = 2-*.W> CHG) . 

4.1 0*(2 n )-time Algorithm 

A straightforward way of computing W c± ^ is to enumerate all partitions of V with computing 

their weights, which trivially takes 0*(2 2n ) time. In the case of bipartite graphs, we can reduce 
the time required for computing the cut-weight distribution. As a first step, this subsection pro- 
poses an 0*(2 ra )-time algorithm, which has the same performance as Ryser's one [25] (in terms 
of the base of the exponential part). Further improvement of the running time is considered in 
the following subsection. 

Let G = (Vi U V2,E) be the input bipartite graph such that \V\\ = | V2 1 = n and \E\ = m, 
and V = V1UV2 for short. For weight-distribution vector W and integer value x € [—m, m], we 
define a x (W) as the vector obtained by shifting each element of W x times. That is, 

{0 if i < x, 

W[i-x] tfn-l>i>x, 
if i > n + x. 

Note that the case of i < x or i > n+x applies only when x is positive or negative respectively. 
Let V be a subset of V. We say that partition (S, V \ S) is conditioned by a subset partition 
(S', V \ S') if S D S' and (V \ S) D (V \ S') holds. Let V s >\v' be the set of all partitions of 
V conditioned by (£', V \ S'), and Wgnv be the cut- weight distribution over all partitions in 
Vs'\v- Our algorithm relies on the fact that can be computed within polynomial time in 

n provided that a partition (S 1 , V\ \ S) of V\ is given. In the following argument, we introduce 
an arbitrary ordering vo, v±, ■ ■ ■ v n -\ of vertices in V2. We define V' 1 = {v^, • • • v n -\} U V\. 
The lemma behind the correctness of our algorithm is stated below: 

Lemma 4 For a given partition (S, V±\S), let I = \c(vi, V\ \ S) \ — |c(vj, S)\. Then W s \ V i+i = 
W S \ V i +ai(W S \ V i) holds. 

Proof From the definition of Wgiyi, W5|yi+i = W Su ^ Vt ^ V i + W s \ V i clearly holds. Thus it 
suffices to show W SU { v .}\ V i = oi(W s \yi). Let (S',V \ S') be a partition in Vs\v^ an d k be 
its weight. By adding Vi to S', the weight increases by I. That is, the weight of partition 
(S'u{vi}, V\(S'U{vi}) is k+l. It implies a one-to-one correspondence between the partitions in 
Vs\yi with weight k and those in Vs U { v .}\yi with weight k + l. Hence we have Wgy/^iiyiffc+Z] = 
IVg|yi[£i] for any k. It clearly follows W'su{t;i}|V' < = <J i^Ws\v i )- Tbe lemma is proved. □ 

The recursive formula in Lemma 4 trivially allows us to compute W^vi = W s \ V n-i within 
polynomial time in n. For the usefulness of the following argument, we encapsulate this recursion 
process by function shift shown in the pseudocode of Algorithm 1. Let L : 2 Vl — > Z^ 1 ' be the 
function such that L(X)[i] = \c(vi, V\ \ X)\ — \c(vi,X)\ holds for any Vi £ Vi- Our 0*(2 n )-time 
algorithm computes and sums up the values of sh'tf^W x \v° ■> L(X)) over all partitions of V\. 
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Algorithm 1 shift: Function for computing \Yx\v™- 1 



1: 

2: 
3: 
4: 
5: 
6: 
7: 



function shift(W,L) 

while L is not empty do 
I <— the head of L 
Remove the head of L 



/* W € N m and , L £ N* */ 



W <- W + ai(W) 



endwhile 
return W 



That is, our algorithm computes the right side of the following equality: 



W, 



CHG)= E shift(WVo,L(S)). 



(2) 



scvi 



The correctness of this formula is obvious from the definition of Ws^. 



Theorem 2 There is an algorithm computing W' 



CHG) 



with 0*(2 n ) time. 



4.2 Function shift as a Linear Transformation 

Before introducing the faster algorithm, we show several properties of Function shift. Let 
H = {hij} € W nxm be the matrix defined as hij = 1 if j = i + 1 and otherwise. It is easy to 
check this matrix works as the operator i.e., for any m-dimensional vector W, WH X = a x (W) 
holds. Hence we can describe function shift(W, L) for a given sequence L = (lo,h, • • • l n -i) as 
follows: 



where / be the m x m identity matrix. We can obtain the following lemma: 

Lemma 5 Letting L and L' be two sequences of integers, and W±, W2 € N m . Then the following 
properties hold: 

1. <7 x (shift(W;L)) =shift(<7 x (W),L), 

2. shift(shift(W, L), U) = shift(W, L o V), 

3. a x {W x + W 2 ) = a x {W r ) + a x (W 2 ), and shift(Wi + W 2 , L) = shifty, L) + shift(W 2 , L), 
where o is i/ie concatenation of two sequences. 

Proof Since a x (W) = shift(W, (x)), we can treat a x equivalently to shift. Clearly, Equation 3 
implies that shift(*,L) is a commutative linear transformation. Thus all properties obviously 
hold. □ 




(3) 
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4.3 Improving Running Time 

In this subsection, we consider an improvement of 0*(2 n )-time algorithm. The running time 

of the improved algorithm is 0*(2 < - 1 ~ 5A1 °s A ' >n ) and consumes exponential space, where A is the 
average degree of the input graph. 

The underlying principle of the improved algorithm is very simple: Separating two smaller 
subproblems. Let (Ti,U\) be a partition of V\ (i.e., T\ = V\ \ U\) fixed by the algorithm, 
N(Ui) C V 2 be the set of vertices adjacent to U±, and vq,v\,--- f n -i be an arbitrary ordering of 
V 2 such that the last \N(U{)\ vertices correspond to N(U\). The cardinality of N(U\) is denoted 
by h for short. Now we consider the situation where U\ and T\ are partitioned into (X, U\ \ X) 
and (Y, T\ \ Y). If we regards X and Y as variables, the first n — h entries (lo,h, ■ ■ ■ l n -h) of 
L(X U Y) become a function of X, which are independent of the value of Y. In contrast, the 
last h entries (l n -h, l n -h+ii ' ' ' ln-i) are a function of both X and Y. Consequently, by two 
appropriate functions L T : 2l Tl l -> Z n ~ h and L v : 2 Tl x 2 Ul Z h , the sequence L(X U 1") can 
be described as follows: 

L(X,Y) = L T (X)oLu(X,Y). 
Then the following lemma holds: 

Lemma 6 L V {X, Y) = L V {X, 0) + L[/(0, Y) - L [/ (0, 0). 

Proof We prove Lu(X,Y)[i] = L v (X,9)[i] + L v (Q,Y)[i] - L(0,0)[i] for any i. Since IC^ 
and Y C U\ are mutually disjoint, the sets of edges c(vi,X) and c(vi,Y) are mutually disjoint. 
Thus we have \c(vi, X UY)\ = \c(vi,X) \ + |c(«j,y)|. Similarly, we have \c(vi,V\ \ (^UY))| = 
\c(v i ,{T 1 \X)ll(U 1 \Y))\ = \c(vi,(Ti\X))\ + \c(vi,(Ui\Y))\. Then we can obtain the following 
equality: 

Lu(X,Y)[{\ = |c(«i,Fi\Xuy)| -\c(vi,(XUY))\ 

= IcK^TAX))! - ic^,^)! + |c(^,(^ \y))| - \c( Vi ,Y)\ 

= Hvi,^ \X))\ - 1^,^)1 - \c( Vi ,X)\ + \c( Vi , (VI \Y))\ - \c(vM)\ - \c( Vi ,Y)\ 
= Lu(X,$)[i] + Lu(fi,Y)[{\ - |c( Vi ,ri)| - |c(«i,^i)| 
= L V (X, 0)[i] + Lu($,Y)[i] - | c ( Vi ,Ti U 

= L[Kx,0)[i] + L[K0,y)[i] -M0.0M 



The lemma is proved. □ 
The improved algorithm runs as follows: 

• (Step 1) We divide all partitions of T\ into several classes Cq,Ci, ■ ■ ■ C x such that for any 
two partitions {X u Ti \ X x ) and (X 2 ,T 1 \ X 2 ) in the same class, L u (X l ,®) = L(X 2 ,$) 
holds. 

• (Step 2) For each i € [1, x], we compute weight distribution Wj = Yl(x Ti\x)ed ^x\v n - h - 1 ■ 
(Note that Wi = E(x,T 1 \x)eC i sW\ft(W xlv o, L T (X)) holds.) 

• (Step 3) Let L(i) be the value of Ljj(X, 0) associated with class Cj and cy = |c(y, V 2 )\ for 
short. For each i € [0, x] and each partition (Y,U\ \ Y) of U\, we compute Lu(i,Y) = 
L{i) + Lu(®, y) -L[/(0, 0) and shift(cr Cy (Wj), y)). The sum of all the values returned 
by function shift is the output of the algorithm. 

We can show the following lemma, which directly leads the correctness of the algorithm: 
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Lemma 7 W£ X(G) = ELi Eyc^ shifty (Wi), Lf/(s, y)). 

Proof Since W x \v° is the distribution over singleton {(X, V° \-X")}, we have Wx|yo[z] = 1 for 
i = \c(X,V°\X)\ and otherwise. Thus, we have (t Cy (W x \ v o)[z\ = 1 for i = \c(X, V° \X)\ + c Y 
and otherwise. Since \c(X, V° \ X) \ + c Y = \c(X UY,V\(Xl)Y))\ holds, we obtain 

<*c(Y)(Wx\V») = W XuY \v°- ( 4 ) 
By using this equation, Lemma 5 and 7, we can obtain the following equality: 

X 

^2 E ^(<yc Y {Wi),Lu{i,Y)) 

i=l YCUi 

= EE Shift (°"cy ( W^| V „- h -l J ,Lj7(«, V) 

i=lYCUi \ \(X,Ti\X)eC i / 

a; 

= EE E shift (^(w^iv^-o.^^y)) 

»=i yci/x (x,Ti\x)ed 

X 

= EE E shift {<rc Y (.sh\fr(W xlv o,L T (X))),Lu(X,Y)) 

i=l YCUi (X,Ti\X)6Ci 
a; 

= E E E shift(shift( ( 7 Cy (W X | F o),L r (X)),L [/ (X,y)) 

i=i ycc/! (x,Ti\x)eCi 

a; 

= E E E sh\h{a CY (W xlv o),L T (X)oLu(X,Y)) 

i=l YCUi (X,Ti\X)ed 

= sMt(W XuYlv0 ,L(XuY)) 

XCTi,YCUi 

The lemma is proved. □ 

We focus on the running time of the algorithm. Clearly the first and second steps of the 
algorithm take 0*(2 n ~\ Ul \) time respectively. The third step requires time of 0*(x2\ Ul \). Thus 
the total running time is 0*(2 n ~^ + x2^). 

How small can we bound xl Clearly, it is upper bounded by the size of the domain of Ljj(X). 
From the definition, the value of Ljj(X)[i — (n — h)\ can take d{vi) + 1 different values for any 
Vi € N(Ui). It follows x < Y\vi€N(Ui)(^( Vi ) + -0- a PPly m g the arithmetic mean-geometric 
mean inequality, we can further bound x by ((YlvieN(Ui)(d(vi) + l))/\N(Ui)\)\ N ( Ul )\. Letting 
Ax be the average degree over X C V in G, we have 

x<(A m) + 1)1^)1. (5) 

We consider how to choose U\. Letting A be the average degree of G, V\ contains a subset X of 
n/5 vertices whose degrees are at most 5A/4. We choose n/(5AlogA) vertices from X as U\. 
For that choice we have \N{U\)\ < n/(41ogA). Since |A^(6 r i)|A A r(t/ 1 ) < An holds, we obtain 
Ajv(Ci) — 4AlogA. By assigning this bound to Inequality 5, we obtain 

x < (4A log A + l)4io g A < (4A i )4io g A = 0(2~e) 
Consequently, it follows that the running time of our algorithm is 0*(2^ 5Ai og A) ra ^ 
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Theorem 3 There is an an algorithm for counting perfect matchings of bipartite graphs which 
runs with 0*(2^ 5Ai og A) n ) time and exponential space. 

5 Related Work 

As seen in the introduction, we have roughly three lines about the studies on counting perfect 
matchings. We introduce the related work along them respectively. 

There has been proposed two different approach for approximating the number of per- 
fect matchings. The first one is the Markov-chain Monte Carlo method, which gives a fully- 
polynomial randomized approximation scheme (FPRAS) for counting perfect matchings [4, 14, 
15]. The second one is a randomized averaging of the determinant [8, 12, 16]. The fastest ap- 
proximation algorithm on this approach is one by Chien et.al. [8], which runs with 0(1. 2 n ) time. 
It is still an open problem whether there exists a FPRAS following this approach or not. 

The second line is the algorithm design for restricted inputs. A seminal work on this line is a 
polynomial-time exact counting algorithm for planar graphs [17]. As other restrictions, graphs 
of bounded genus [11, 27] or bounded treewidth [2, 24], and chordal graphs with its subclass [23] 
are considered. 

About the line of exact algorithms, we have already mentioned the results for bipartite 
graphs in the introduction. Thus we introduce only the work on counting perfect matchings 
for general graphs. A first result breaking the trivial 0*(2 m )-time bound is one by Bjorklund 
and Husfeldt [6], which has shown two algorithms: The first one runs with 0*(2 2n ) time and 
polynomial space, and the second rounds with 0*(1.733 2ra ) time and exponential space. These 
algorithms are similar with our result in the sense that it also reduces the problem into a counting 
over a different universe. A number of the following studies improve this bound [1, 5, 18, 21, 22]. 
The most recent and fastest one is the algorithm by Bjorklund [5], which achieves the same 
running time as Ryser's algorithm (that is, currently we do not find the difference of inherent 
difficulty between bipartite and general graphs). About time complexity, Dell et.al. [9] has 
shown that any algorithm has an instance of m edges incurring Q(exp(m/ log m)) time if we 
believe that a counting version of the Exponential Time Hypothesis [13] is true. 

6 Concluding Remarks 

In this paper, we presented a new algorithm for the problem of counting perfect matchings, 
which has an improved time bound depending on the average degree A of the input graph. 
Compared to previous results, our algorithm runs faster for many cases. In particular, the 
performance degradation to the increase of A is quite slower than the previous algorithms. The 
main idea of our algorithm is a new reduction to computing the cut-weight distribution of the 
input graph. Our algorithm is designed by combining this reduction with a novel algorithm for 
the computation of cut-weight distribution. The approach itself is quite new, and may be of 
independent interest. Finally, we conclude the paper with several open problems related to our 
approach. 

• Can we achieve the running time exponentially faster than Ryser's one by designing a 
faster algorithm computing cut-weight distribution? 

• The reduction part of our result is directly applicable to any graph (which may not be 
bipartite). Can we use the reduction to obtain a faster algorithm for general graphs? 
Actually, letting 1(G) be the independent sets of the input graph G, we can easily obtain 
an algorithm with 0*(2 2n ~l 7 ( G )l) running time by regarding G as a "quasi" bipartite graph 
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of two vertex sets 1(G) and V\I(G) and applying our 0*(2 n )-time algorithm, which gives 
the same performance as the algorithm by [1]. 

• Is it possible to design a faster FPRAS for counting perfect matchings based on our 
method? Note that an (l+e)-approximation of the cut-weight distribution trivially induces 
an (1 + e)-approximation of the number of perfect matchings because of the linearity of 
the transformation. 

• Computing cut-weight distribution is a special case of the counting version of 2-CSP, 
which is addressed by Williams [28]. In this sense, our reduction gives a new linkage 
from counting perfect matchings to CSP. Can we use this linkage for obtaining some new 
complexity result around those problems? 

• Can we apply the same technique to other combinatorial problems? Interestingly, there 
has been proposed a variety of MacWilliams-style Identities in the field of the coding 
theory. We may find a useful transformation from those resources. In addition, it may be 
an interesting approach to focus on the primal-dual relationship of two universes. Can we 
design a kind of primal-dual algorithms for counting problems? 
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